Using Domain Knowledge Provided by Ontologies for Improving Data Quality Management

نویسندگان

  • Stefan Brueggemann
  • Fabian Gruening
چکیده

Several data quality management (DQM) tasks like duplicate detection or consistency checking depend on domain specific knowledge. Many DQM approaches have potential for bringing together domain knowledge and DQM metadata. We provide an approach which uses this knowledge modeled in ontologies instead of aquiring that knowledge by cost-intensive interviews with domain-experts. These ontologies can directly be annotated with DQM specific metadata. With our approach a synergy effect can be achieved when modeling a domain ontology, e.g. for defining a shared vocabulary for improved interoperability, and performing DQM. We present three DQM applications which directly use knowledge provided by domain ontologies. These applications use the ontology structure itself to provide correction suggestions for invalid data, identify duplicates, and to store data quality annotations at schema and instance level.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a Data Model for Quality Management Web Services: An Ontology of Measurement for Enterprise Modeling

Though the WWW is used for business process automation to lower costs and shorten leadtimes, arguably its use has been limited for another metric of business success: Improving quality. A promising advancement to the WWW is the development of the Semantic Web, which relies upon using machine process-able domain knowledge represented in ontologies. Therefore, one promising area of research and a...

متن کامل

A Knowledge Representation Formalism for Semantic Business Process Management

Business process models are increasingly used to create clarity about the logical sequence of activities in public and private organizations belonging to different industries and areas. To improve Business Process Management (BPM), semantic technologies (like ontologies, reasoners, and semantic Web services) should be integrated in BPM tools in order to enable semantic BPM. Semantic Business Pr...

متن کامل

Evaluation and ranking of selected hospitals in Mashhad in terms of quality of services provided by the method of FAHP and GRA-TOPSIS

Background: Assessing and improving the quality of services in hospitals because deal with the health of humans is very important. The purpose of this study is to identify and weigh quality criteria and ranking of four hospitals in Mashhad.   Materials & Methods: The present study is of type  Applied Studies  that is a cross-sectional study conducted in the winter of 1396. In this study, by l...

متن کامل

Text categorization using automatically acquired domain ontology

In this paper, we describe ontology-based text categorization in which the domain ontologies are automatically acquired through morphological rules and statistical methods. The ontology-based approach is a promising way for general information retrieval applications such as knowledge management or knowledge discovery. As a way to evaluate the quality of domain ontologies, we test our method thr...

متن کامل

Use of Existing Ontologies as Input for Structural Complexity Management - Reducing the Effort for Analysing and Improving Engineering Systems

This paper presents an approach for combining two actual trends in the engineering domain: ontology-based knowledge management and structural complexity management. A focussed engineering system can be analysed and possibilities for improvements can be deduced with low effort by applying structure based algorithms on already existing ontologies. An overview of the current use of ontologies in t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008